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(54) Method for duplicating data of storage subsystem and data duplicating system 



(57) In order to realize asynchronous type system 
assuring the consistency of data with the function of disk 
subsystems without the need of Introducing new soft- 
ware to a host unit and without the deterioration of the 
performance of a main center, in a remote copy system 
which copies the data to the disk subsystems of the re- 
mote center for duplicating the data in the disk subsys- 
tems of the main center, the disk subsystems give serial 
numbers and times to the data together with writing said 



data to the storage devices in the disk subsystem and 
transfer said data to the other disk subsystems, and the 
other disk subsystems arrange the two or more data in 
the sequence of the serial numbers, decide the oldest 
time among the latest time given to each of the disk sub- 
systems communicating among the disk subsystems 
and the data given with the time not later than the de- 
cided oldest time are the objects of writing to each of the 
disk storage devices. 



FIG.1 



Host unft J 



3-1 



Disk 
subaystem 



Disk 
subsystem 

3-2 



— t Disk : 
\ subsystem 

3-n 



r 

4-1 



4-n 



8_ Host unit 



7-1 



Disk 



f- subsystem 



Disk 
subsystem 



7-2 



\ Disk sub- _ 
" system '\ 

7-n 



Remote center 



10 



Printed by Jouve. 75001 PARIS (FR) 



1 



EP1 150 210 A2 



2 



Description 

[ Background of the invention ] 

[ Field of the Invention ] s 

[0001 ] The present invention relates to a storage sub- 
system storing data for a computer to refer to and to up- 
date, and more particularly to the method of duplicating 
the data held by a storage subsystem and a data dupli- 
cating system. 

[ Description of the Prior Art ] 

[0002] Companies and financial institutions are 
known to have both computer centers and backup com- 
puter centers preparing for the disasters of earthquake, 
etc. In systems which have backup facilities, the distanc- 
es between the main computer centers and the remote 
computer centers are from a hundred kilometers to sev- 
eral hundreds kilometers apart geometrically, and the 
data are held in disk subsystems installed in the main 
centers and the remote centers with the duplicating. 
Systems having so called remote copy functions, 
wherein writing data generated at the disk subsystem of 
the main center are transferred to the disk subsystems 
of the remote center and the transferred same data are 
written into the disk subsystems of the remote center, 
have been already realized. The remote copy functions 
are classified into a synchronous type and an asynchro- 
nous type. 

[0003] The synchronous type means a processing 
procedure that, when an update (writing) instruction is 
issued from the host computer (host unit) in the main 
center to the disk subsystem, the completion of the up- 
date processing is reported to the host unit of the main 
center after the instructed update (writing) to the disk 
subsystem of the remote center is completed if the in- 
structed object is the object of the remote copy function. 
In this case, time delay is generated by transmission 
time caused by the performance of data transmission 
lines connecting between the main center and the re- 
mote center according to geometrical distances. 
[0004] On the contrary, the asynchronous type means 
a processing procedure that, when an update (writing) 
instruction is issued from the host unit in the main center 
to the disk subsystem, the completion of update 
processing is reported to the host unit immediately after 
the completion of update processing at the disk subsys- 
tem in the main center, if the object of the instruction is 
the object of the remote copy function, and the data up- 
date (reflection) to the disk subsystem of the remote 
center is executed asynchronously with the processing 
regarding to the main center. With the asynchronous 
type, the data update is completed within the processing 
time required in the main center and the time delay 
caused by the data storing into the remote center is not 
generated. Therefore, when the highest priority is to 



avoid the influence of the transmission time to the oper- 
ation of the main center in the remote copy between the 
distant disk subsystems, the asynchronous type remote 
copy will be more preferable than the synchronous type 
remote copy. 

[0005] With the asynchronous remote copy, the data 
of the disk subsystem of the remote center is not always 
the same with the data on the side of the main center. 
The data that has not been reflected on the side of the 
remote center may be lost when the main center loses 
the function by disasters. However, the accessing per- 
formance of the disk subsystem on the side of the main 
center can be maintained at similar level with that of the 
case where the remote copy function is not applied. 
[0006] With such prior arts, there have been kinds of 
problems for achieving the remote copy function with the 
intervention of the host unit. 

(1 ) Maintenance of the consistency of the data 

[0007] In the execution of the remote copy, an inde- 
pendent communication link connects between the disk 
subsystem of the main center and the disk subsystem 
of the remote center. That is, in the execution of the re- 
mote copy between the two or more disk subsystems of 
the main center and the two or more disk subsystems 
of the remote center, the configuration comprises two or 
more pairs of the disk subsystems connected by the in- 
dependent communication links. In performing the 
backup of the main center having two or more disk sub- 
systems by the remote center, there is a problem of 
maintaining the update sequence of the data among the 
two or more disk subsystems, that is "the maintenance 
of the consistency". In the asynchronous remote copy, 
it is unavoidable that the reflection of the updated data 
to the remote center delays from the time of actual up- 
date processing at the main center. However, the se- 
quence of the update must be in conformity with that of 
the main center. At least, at the moment a user requests 
data at the remote center, the data maintained with the 
consistency must be stored at the remote center. 
[0008] Generally, a database comprises a database 
main body, kinds of log information that record the his- 
tory of data updates, and control information, and there 
are many cases of system design where each of the da- 
tabase main body, the kinds of log information, and the 
control information are stored in different disk subsys- 
tems for the security reason. As there is a relativity 
among the database main body, the kinds of log infor- 
mation and the control information, these log informa- 
tion and control information are supplemented or updat- 
ed at the data update processing giving to that of the 
database main body for maintaining the consistency of 
the system. These series of update are executed in se- 
quence with time intervals of the order of several micro- 
seconds at the shortest. If the sequence of the update 
is confused, the consistency of the information regard- 
ing to the sequence of the update may be lost leading 
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to the total destruction of the database. 
[0009] For example, if the update of the log informa- 
tion, etc. is executed after the update of the database at 
the main center, there is a possibility of the updated log 
information, etc. arriving at the remote center before the 5 
arrival of the updated information of the database main 
body depending on the circumstance of the above de- 
scribed communication links comprising the remote 
copy system. Therefore, there is a hidden possibility of 
generating the situation where the supplementing or up- 
dating of the log information, etc. at the remote center 
is executed in advance of that of the data base main 
body at the remote center. If the main center suffers from 
disasters at the state of logical unconformity where only 
the log information, etc. are supplemented or updated 
but the database main body relating to the log informa- 
tion is not updated, the database itself of the remote 
center cannot be of use. Therefore, there is a problem 
that the update of the data at the remote center must be 
executed with the same sequence of the sequence of 
the update of the data at the main center. 

(2) Remote copy function with the intervention of a host 
unit 

[0010] In case of realizing an asynchronous remote 
copy in the circumstance of both main center and re- 
mote center with two or more disk subsystems, such 
technologies are known that the host unit at the main 
center gives information related to the update sequence 
as a time stamp to the data when the host unit at the 
main center instructs the update of the data to the disk 
subsystem, and the host unit of the remote center exe- 
cutes the reflection processing of the update of the data 
to the disk subsystem of the remote center based on 
such time information. For example, the Japanese pat- 
ent unexamined publication 6-290125 (US patent 
5446871) discloses such technology. The Japanese 
patent unexamined publication 6-290125 discloses the 
realization of a remote copy function with the interven- 
tion of a host unit. Specifically, by the cooperative oper- 
ation of the operating system and the disk subsystems 
of the host unit on the side of the man center and the 
data mover software and the disk subsystems of the 
host unit on the side of the remote center, the issue and 
the transmission of the update sequence information 
and the reflective processing of the update data based 
upon the update sequence information are realized. 

[ Summary of invention ] v 

[0011] According to above described prior art, an 
asynchronous remote copy function is realized assuring 
the update sequence of the data between the main cent- 
er and the remote center. However, this technology re- 
quires software for both of the host unit and the disk sub- 
system for realizing the asynchronous remote copy 
function and the cooperation between them. The dedi- 



cated new software has to be introduced; therefore, the 
introduction of dedicated new software, setting, inspec- 
tion, and the review of system design caused by the in- 
crease of a load to a CPU will be generated for a user. 
Therefore, there has been a sort of a barrier to introduce 
the remote copy by the prior art as it requires a certain 
amount of introducing period and the cost of introduc- 
tion. Actually, there is no realizing means of the remote 
copy by the prior art for a system, wherein the host unit 
cannot give the information regarding to the update se- 
quence, such as time stamps, or the system having no 
structure of correcting internal clocks of two or more host 
computers, that are, especially, open systems. Notwith- 
standing the type of the host unit or host software, the 
consistency of the update sequence of the data must be 
maintained with the function of the disk subsystem only, 
for realizing asynchronous remote copy with the func- 
tion of the disk subsystem only. In case the data that 
require the consistency of update sequence are dis- 
persed being stored in two or more disk subsystems, 
there is a problem of lack of means to maintain the con- 
sistency of the update sequence among the two or more 
disk subsystems. 

[001 2] The object of the present invention is to realize 
an asynchronous remote copy function assuring the 
consistency of data with the function of the storage sub- 
system only without the need of introducing new soft- 
ware to the host unit, easy to introduce, and with mini- 
mum deterioration of the performance of the computer 
at the main center. 

[0013] The present invention is the method of dupli- 
cating data of a system which is provided with a first 
storage subsystem group comprising two or more stor- 
age subsystems and a second storage subsystem 
group comprising two or more storage subsystems 
which store copies of the data of the first storage sub- 
system group, wherein the method of the duplicating the 
data is that the data being written into each of storage 
devices from each of the storage subsystems which be- 
longs to the first storage subsystem group are given with 
serial numbers and times, the data are transferred 
through transmission lines to the storage subsystems 
which belong to the second storage subsystem group, 
the two or more data being received by each of the stor- 
age subsystems which belongs to the second storage 
subsystem group are arranged in sequence of the serial 
numbers, the oldest time is decided comparing the latest 
times given to each of the storage subsystems by the 
communication among the storage subsystems which 
belong to the second storage subsystem group, and the 
data given with the times earlier than the decided oldest 
time are the objects of data writing into the storage de- 
vices of the storage subsystems. 

[ Brief description of the drawings ] 

[0014] Fig. 1 is a diagram showing the total structure 
of the remote copy system of the preferred embodiment 
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of the present invention. 

[001 5] Fig. 2 is a diagram showing the inner structure 
of the disk subsystem of the preferred embodiment of 
the present invention. 

[0016] Fig. 3 is a flowchart showing the procedure of 5 
duplicating data from a disk subsystem 3 to a disk sub- 
system 7 of the preferred embodiment of the present 
invention. 

[0017] Fig. 4 is a diagram illustrating an example of 
the flow of the data given with the time of receipt and 
the serial number of the receipt till it is being remote cop- 
ied of the preferred embodiment of the present inven- 
tion. 

[0018] Fig. 5 is a flowchart showing the procedure of 
processing by the disk subsystem 7 regarding to the set- 
tlement of the justified time of the preferred embodiment 
of the present invention. 

[0019] Fig. 6 is a flowchart showing the procedure of 
processing by the disk subsystem 7 regarding to the re- 
port of the justified time of the preferred embodiment of 
the present invention. 

[ Detailed description of the invention ] 

[0020] The system of a preferred embodiment of the 
present invention will be explained as follows, with ref- 
erence to the drawings. 

[0021] Fig. 1 is a block diagram of a system that du- 
plicates data between two data centers, each of which 
is provided with a computer. Two or more disk subsys- 
tems 3-1 , 3-2, — 3-n on the side of a main center 9 and 
two or more disk subsystems 7-1 , 7-2, — 7-n on the side 
of a remote center 10 are connected mutually without 
the intervention of host units 1 and 8, and a remote copy 
system that duplicates data between both centers is re- 
alized. As an example of the connection of the disk sub- 
systems without the intervention of the host units 1 and 
8, the connection utilizing Storage Area Network (SAN) 
can be cited, though the detail of it is not described here. 
[0022] The host unit 1 at the main center 9 having a 
central processing unit (CPU) that performs data 
processing is connected with the disk subsystems 3-1 , 
3-2, — 3-n through an interface cable 2. 
[0023] The structure comprises the disk subsystem 
3-1 being connected with the disk subsystem 7-1 of the 
remote center through an interface cable 4-1 , the disk 
subsystem 3-2 being connected with the disk subsys- 
tem 7-2 through an interface cable 4-2, and the disk sub- 
system 3-n being connected with the disk subsystem 
7-n through an interface cable 4-n similarly. Hereinafter, 
one of the disk subsystems 3-1 , 3-2, — 3-n or one of the 
disk subsystems 7-1 , 7-2, — 7-n may be referred to as a 
disk subsystems 3 or a disk subsystems 7. It will be sim- 
ilar with other structure. 

[0024] The interface cables 4-1 , 4-2,— 4-n will include 
optical fiber links driven by LED drive units and the links 
using optical fiber cables driven by an interface protocol 
generally called a fiber channel. Electric communication 



links, typical of which are a T3 network and an ATM net- 
work can be included in it. Therefore, between the disk 
subsystem 3 of the main center 9 and the disk subsys- 
tem 7 of the remote center 10 can be connected with 
general fiber channel switches, or can be connected 
with the T3 network, the SONET network, or the ATM 
fietwork. Though not shown by Fig. 1 , these are called 
as the interface cables 4 as these connecting configu- 
rations are feasible in the preferred embodiment of the 
present invention. 

[0025] Any of the disk subsystems 7-1 , 7-2, — 7-n be- 
ing stored with the data which are the object of the re- 
mote copy inside the remote center 1 0 is connected with 
another disk subsystem being stored with the data 
which are the object of the remote copy inside the same 
center through an interface cable 5. In the preferred em- 
bodiment of the present invention, the disk subsystems 

7 are formed with a loop configuration being connected 
sequentially as with connecting the disk subsystem 7-1 
with the disk subsystem 7-2, the disk subsystem 7-2 with 
the disk subsystem 7-3, and the disk subsystem 7-n with 
the disk subsystem 7-1 . 

[0026] A host unit 8 is a central processing unit which 
is connected with the disk subsystems 7-1 , 7-2, — 7-n at 
the remote center 10 through an interface cable 6 and 
executes the reference and the update to the disk sub- 
systems 7-1 , 7-2, — 7-n. The host unit 8 can execute the 
processing substituting a host unit 1 when the host unit 
1 of the main center 9 cannot execute the regular func- 
tions by disasters or failures. Furthermore, the host unit 

8 can execute the different processing from those of the 
host unit 1 of the main center 9 independent of the host 
unit 1 using the data stored in the disk subsystems 7-1 , 
7-2, — 7-n. However, the host unit 8 will not be necessary 
when the host unit 8 does not do any processing to the 
disk subsystems 7-1, 7-2, — 7-n. 

[0027] Fig. 1 shows the system configuration that 
when the host unit 1 issues the data write instruction to 
the two or more disk subsystems 3-1, 3-2, — 3-n, the 
same data are stored into the two or more disk subsys- 
tems 7-1 . 7-2, — 7-n inside the remote center 1 0 main- 
taining the consistency of the processing and the data 
update at the main center 9. Arrows in Fig. 1 indicate 
the flow of the data instructed for writing from the host 
unit 1 . 

[0028] Fig. 2 is a block diagram showing the inner 
configuration of the disk subsystem 3 and the disk sub- 
system 7. The disk subsystem 3 is provided with an In- 
terface control part 11 for transmitting and receiving the 
commands and the data transferred from the host unit 
1 and for connecting with other disk subsystems 7, a 
cache memory 12 which stores the data being referred 
to or updated by the host unit 1 , a magnetic disk drive 
13 which is a magnetic media to store such data, a con- 
trol memory 14 which stores the management informa- 
tion of the data, the status information of the remote 
copy, time information, etc., and a disk subsystem con- 
trol part 17 which controls each element of these. The 
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disk subsystem control part 17 operates by program 
control using a micro processor. 
[0029] The disk subsystem 7 has the similar internal 
constitution- The disk subsystem 3 and the disk subsys- 
tem 7 can be configured as a disk subsystem having 5 
same internal constitution and functions so that the disk 
subsystem can operate as the disk subsystem 3 under 
the local mode and operate as the disk subsystem 7 un- 
der the remote mode. Each of the operating functions 
at the local mode and the remote mode is explained as 
follows. Generally a part corresponding to the magnetic 
disk drive 13 can be a storage device other than mag- 
netic disk drive, and the disk subsystems 3 and 7 can 
be replaced with the units generally called as storage 
subsystems. 

[0030] Each of the disk subsystem 3 is provided with 
a clock 15 and a clock correcting part 1 6. The clock cor- 
recting part 16 corrects the clock 15 occasionally based 
on the time information transferred from a time server 
located inside the unit cabinet of each of the disk sub- 
system 3 or closely enough at different position from 
each of the disk subsystems and keeps the time differ- 
ence between the time that the time server transfers and 
the time of the clock 1 5 in each of the disk subsystems 
within the tolerance of several microseconds. The time 
server is an equipment provided with the function of re- 
ceiving radio waves containing time information as a 
GPS and a radio controlled clock, and the function of 
transferring and delivering the received time information 
to each of the disk subsystems. If the time server is not 
provided with the radio wave receiving function, the 
function of generating time signal by the timer itself and 
the function of transferring and delivering it to each of 
the disk subsystems 3 will substitute it. If one of the disk 
subsystems 3 is provided with the function of the time 
server, an independent time server will not be neces- 
sary. A time server 20 shown by Fig. 2 is located close 
enough to the disk subsystem 3 and provided with a 
GPS receiving part 21 which receiving the information 
including time from satellites, the time signal acquiring 
part which acquires time from the received information, 
a time signal generating structure which generates time 
signal continuously by itself when the information can- 
not be received, and a time delivering part 22 whtoh de- 
livers the time information to the disk subsystems 3. 
[0031] The frequency of executing the update 
processing to the disk subsystems 3 by the host unit 1 
which is generally called a host computer will depend 
on each of the systems but with the high frequency case, 
the update processing may be executed in succession 
at the time intervals of several microseconds. The rea- 
son why the accuracy of the clock 1 5 of each of the disk 
subsystem 3 is required to be within the range of several 
microseconds is to allow the disk subsystem 3 to se- 
curely grasp the difference of the time of the update 
processing and to grasp the sequence of the update 
processing of the data based on the time of the update 
processing tn case of the update processing is executed 
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in succession at the time intervals of several microsec- 
onds. If the frequency of the update processing which 
is executed by the host computer to the disk subsystem 
3 is with the time intervals shorter than several micro- 
seconds, then the accuracy of the clock 1 5 held by each 
of the disk subsystem 3 must be less than the time unit 
of the time interval of the processing. 
[0032] Fig. 3 is a flow chart showing the procedure of 
the processing of duplicating the data covering the disk 
subsystem 3 and the disk subsystem 7. As the initial 
condition before the following processing, the result of 
the previous addition or update of the data are reflected 
between the disk subsystem 3 and the disk subsystem 
7 and the duplication of the data must be completed. 
The host unit 1 issues the write request (hereafter called 
a write command) to the disk subsystem 3 (step 31 ). 
When the disk subsystem 3 receives the write command 
through the interface control part 11 , the disk subsystem 
control part 17 of the disk subsystem 3 starts the 
processing based on the write command. Here, the write 
command indicates a command to transfer the instruc- 
tion to write the data into the cache memory 12 and the 
write data itself. The disk subsystem control part 17 of 
the disk subsystem 3 obtains the time of the receipt of 
the write command from the clock 1 5 (step 32), stores 
the data into the cache memory 12 (step 33), and stores 
the management information of the data into the control 
memory 14 (step 34). The management information in- 
cludes the address information of a writing address of 
the magnetic disk drive 13, the write command receipt 
time of the data, the serial number of the receipt of the 
write command, and a pointer to the data on the cache 
memory 12. Here, the serial number of the receipt of the 
write command is a series of numbers given to the write 
commands received by the disk subsystem 3. Then, the 
completion of the processing corresponding to the re- 
ceived write command is reported to the host unit 1 (step 
35). The data stored into the cache memory 12 is re- 
corded into the magnetic disk drive 13 afterwards; how- 
ever, the detail is not described here as it is of the prior 
art. 

[0033] The disk subsystem control part 1 7 issues the 
write command (step 37) to the connected disk subsys- 
tem 7 through the interface control part 11 and the in- 
terface cable 4 asynchronous with the processing cor- 
responding to the write command from the host unit 1 , 
when the remote copy is not in the suspending mode 
(step 36 No) referring to a control bit on the control mem- 
ory 14. Here, the write command includes a command 
to transfer an instruction to write the data and the write 
data itself as is similar with the write command received 
from the host unit 1 and, further, includes the receipt time 
of the data and the serial number of the data receipt. 
When the remote copy is in the suspending mode (step 
36 Yes), the address information of the data writing ad- 
dress of the magnetic disk drive 1 3 is stored into the 
control memory 14 inside the own disk subsystem 3 
(step 38) and the write command is issued to the disk 
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subsystem 7 of other party after the remote copy retums 
to a normal mode. 

[0034] The disk subsystem 7 receives the write com- 
mand issued from the disk subsystem 3 through the in- 
terface control part 11 (step 41 ). The write command in- 
cludes the time of receipt of the data by the disk sub- 
system of the main center and the serial number of the 
receipt of the data by the disk subsystem of the' main 
center adding to the data. The disk subsystem 7 starts 
the processing according to the received write com- 
mand. The disk subsystem control part 17 of the disk 
subsystem 7 rearranges the received write commands 
in sequence of the serial number of the data receipt and 
checks the missing of the serial number, i.e. the missing 
of write commands. Then, the received data are stored 
into the cache memory 12 as "provisionary data", the 
management information of the data is stored into the 
control memory 14. and the completion report of the da- 
ta receipt is sent to the disk subsystem 3 of other party 
(step 42). 

[0035] Next, the disk subsystem control part 1 7 of the 
disk subsystem 7 selects candidates to determine the 
justified time among the data based on the time con- 
tained in the received write commands and decides the 
candidates of the justified time cooperating with other 
disk subsystem 7 (step 43). The decision of the candi- 
dates of the justified time will be explained later. When 
the justified time is decided in the manner as described 
above, the data having the receipt time not later than 
the justified time are stored into the cache memory 12 
as "proper data" (step 44). The data stored into the 
cache memory 12 are recorded into the magnetic disk 
drive 13 afterward. 

[0036] Fig. 4 is a block diagram which explains with 
an example the flow of the data by the time the data 
given with the time of receipt and the serial number of 
receipt are remote copied. The host unit 1 being gener- 
ated with five write data of D1, 02, D3. D4, and D5 in 
this sequence issues the write commands to the disk 
subsystem 3-1 and the disk subsystems 3-2 sequential- 
ly. The disk subsystem 3-1 receives the data of D1 , D3 
and D5, and gives the serial numbers of receipt and the 
time of receipt which areSi/TI, S2/T3, and S3/T5 to the 
data. On the other hand, the disk subsystem 3-2 re- 
ceives the data of D2 and D4, and gives the serial num- 
bers of the receipt and the time of receipt which are 
S1/T2 and S2/T4 to the data. The disk subsystem 3-1 
issues write commands on each of the data to the disk 
subsystem 7-1 after storing the received data into the 
cache memory 1 2 in the sequence of the serial num- 
bers. The disk subsystem 3-2 issues write commands 
to the disk subsystem 7-2 on each of the data after stor- 
ing the received data into the cache memory 12 in the 
sequence of the serial numbers. 
[0037] The disk subsystem 7-1 receives the data giv- 
en with the serial numbers of receipt and the time of the 
receipt, and stores the data as the "provisional data" into 
the cache memory 12. The disk subsystem 7-2 receives 
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the data given with the serial numbers of receipt and the 
time of the receipt, and stores the data as the "provision- 
al data" into the cache memory 12. Next, the time of the 
latest data D5 among the data being received by the 

5 disk subsystem 7-1 and the time of the latest data D4 
among the data received by the disk subsystem 7-2 are 
compared between the disk subsystem 7-1 and the disk 
subsystem 7-2. Here, the data D5 and the data D4 are 
called as the candidates of the justified time. In this ex- 

10 ample, the time T4 given to the D4 is judged to be older 
than the time T5 given to the D5, so that the time T4 
given to the data D4 is decided to be the justified time 
and the data D2 and D4 given with time not later than 
the justified time are reflected to the cache memory 12 

15 as the "proper data". The data D1 and D3 given with 
time not later than the time T4 are reflected to the cache 
memory 12 as the "proper data". 
[0038] As the result of above described processing, 
the data D5 will remain as the "provisional data" and the 

20 data D5 will be reflected to the cache memory 1 2 at least 
after the reflection of the data D4 to the cache memory 
1 2. As described above, by keeping the sequential order 
of the updates of the data D4 and D5, the recovery of 
the data and the resuming of the remote copy are ena- 

25 bled at the suspension of the remote copy caused by a 
failure. On the contrary, if the failure occurs after the data 
D5 is updated and reflected preceding the update and 
reflection of the data D4, generally there will be the pos- 
sibility of data loss and the data recovery and the resum- 

30 ing of the remote copy will become difficult. On the other 
hand, the update sequences of the data D1 and D3 of 
the disk subsystem 7-1 . and the data D2 and D4 of the 
disk subsystem 7-2, of which given time stamps are jus- 
tified, will be the sequences of updates in the order of 

35 the data D1 and D3 of the disk subsystem 3-1 and the 
data D2 and D4 of the disk subsystem 3-2, and the se- 
quential order of the updates can be maintained. 
[0039] As above described, the serial numbers of re- 
ceipt is the sequence of the updates inside the one disk 

40 subsystem 3. and, therefore, the numbers indicating the 
sequence of updates inside the one disk subsystem 7. 
The serial numbers of receipt are used also for the check 
of the data dropout. On the contrary, the time of the re- 
ceipt is the numerical value which shows the sequence 

45 of updates covering two or more disk subsystems 3 and 
used for the decision of the Justified time covering two 
or more disk subsystems 7. 

[0040] Fig. 5 is a flow chart showing the procedure of 
processing corresponding to the decision of the justified 

50 time in the disk subsystem 7. The disk subsystem con- 
trol part 17 of the disk subsystem 7 which issues the 
decision request of the candidates of the justified time 
selects the candidates of the justified time in the own 
subsystem (step 51 ). That is, the write command given 

55 with the latest time of the receipt among the write com- 
mands arranged in the sequence of the serial numbers 
of the data receipt is selected. Then, the command is 
issued requiring the decision of the selected candidates 
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of the justified time to the adjacent disk subsystem 7 
which is connected by the interface cable 5 (step 52). 
This command includes the request indicating the deci- 
sion request, the time given to the candidates of the jus- 
tified time, and the unit production number (manufactur- 5 
ing number) of the disk subsystem 7. Furthermore, the 
time of the decision request is memorized inside the own 
control memory 14. 

[0041] The adjacent disk subsystem 7 receives this 
command and judges whether the manufacturing 
number in the command meets the manufacturing 
number of the own unit (step 53). If the number is not 
the manufacturing number of the own unit (step 53 No), 
the candidate of the justified time in the own unit is se- 
lected according to the above described processing 
(step 54). Next, the received time of the candidates of 
the justified time and the time of the candidate of the 
justified time of the own unit are compared and judged 
(step 55). If the time of the own unit is older (step 55 
Yes), the time and the manufacturing number of the can- 
didates of the justified time that is the object of the de- 
cision are replaced with the time and manufacturing 
number of the candidate of the justified time of the own 
unit (step 56). If the time of the own unit is newer (step 
55 No), jump to step 57. Then, the command is issued 
requesting the decision of the candidates of the justified 
time, of which content is replaced with the content of the 
own unit or as is received, to the adjacent disk subsys- 
tem 7 connected by the interface cable 5 (step 57). At 
this, the time of the decision request is memorized inside 
the own control memory 14. After the processing of the 
step 57, return to the step 53. 

[0042] Receiving the decision request command and 
the manufacturing number in the command which meets 
with the manufacturing number of the own unit (step 53 
Yes), the decision requested candidates of the justified 
time is decided as the justified time (step 58). That is, 
the candidate of the justified time is selected in each unit 
of the disk subsystems 7 and the oldest time among 
them is decided as the justified time. 
[0043] Fig. 6 is a flowchart showing the procedure of 
processing by the disk subsystem 7 regarding to the no- 
tification of the justified time. The disk subsystem 7 that 
has decided the justified time stores the data given with 
the time not later than the time of the justified time as 
"proper data" into the cache memory 1 2 (step 61 ). Next, 
a command informing the decided justified time is is- 
sued to the adjacent disk subsystem 7 (step 62). The 
command includes a request indicating the notification 
of the justified time, the decided justified time, and the 
manufacturing number of the disk subsystem 7. At this 
moment, the time of issuing the notification command is 
memorized into the own control memory 14, 
[0044] Receiving the command, the adjacent disk 
subsystem 7 judges whether the manufacturing number 
in the command meets with the manufacturing number 
of the own unit (step 63). If It Is not the manufacturing 
number of the own unit (step 63 No), the data received 



from the disk subsystem 3 and given with the time not 
later than the justified time are stored into the cache 
memory 1 2 as the "proper data" (step 64). Then, the jus- 
tified time is notified to the adjacent disk subsystem 7 
by transmitting the received command as it Is received 
(step 65) and step 63 is resumed. 
[0045] Receiving the justified time notifying command 
and the manufacturing number in the command which 
meets with the manufacturing number of the own unit 
(step 63 Yes), the infonnation as the manufacturing 
numbers which became useless in the data given with 
the time not later than the justified time are erased from 
the control memory 14 (step 66). As the result of above 
processing, all of the write commands given with the 
time not later than the justified time over the two or more 
disk subsystems 7 are reflected to the data updates in 
each of the disk subsystems 7 and the new write com- 
mands of later than the justified time will remain as the 
object of selection for the next candidates of the justified 
time. 

[0046] As for the start up of the decision request of 
the justified time, there is a method that a first start up 
disk subsystem 7 is detemnined beforehand and then, 
the disk subsystem 7 which is decided with the justified 
time starts up the next decision request of the justified 
time thereafter. There is another method that one of the 
disk subsystems 7 starts up the decision request of the 
justified time periodically. Any method that executes the 
decision of the justified time and the notification of the 
justified time periodically will be acceptable. 
[0047] In the above described preferred embodiment, 
the product serial number is used as an identifier to dis- 
criminate each of the disk subsystems 7; however, it is 
not limited to the product serial number. Any identifier 
which can discriminate each of the disk subsystems 7 
of the remote center can be used as the identifier. 
[0048] In case the decision request command of the 
justified time issued by oneself does not return or the 
notification of the justified time is not received within the 
predetermined time after the issue of the decision re- 
quest of the justified time, or in case the notification of 
the justified time issued by oneself does not return within 
the predetermined time after the issue of the notification 
of the justified time, the disk subsystem 7 is judged to 
be not able to have completed the decision of the time 
or the notification of the time caused by the occurrence 
of some failures and sets the control bit of the control 
memory 14 to the remote copy suspension mode to sus- 
pend the remote copy. As above described, when the 
remote copy cannot be continued by the failures on the 
side of the disk subsystem 7, the remote copy is sus- 
pended and the suspension of the remote copy is noti- 
fied to the disk subsystem 3 of other party. Receiving 
the notification, the disk subsystem 3 sets the control bit 
of the control memory 1 4 to the remote copy suspension 
mode to suspend the remote copy and holds the above 
described transmission of the write command received 
from the host unit to the disk subsystem 7. When the 
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remote copy cannot be continued by the failures on the 
side of the disk subsystenn 3, the remote copy is sus- 
pended similarly and the suspension of the remote copy 
is notified to the disk subsystem 7 of other party. The 
disk subsystem 3 and the disk subsystem 7 detected 5 
the interrupted communication between the disk sub- 
system 3 and the disk subsystem 7 caused by the failure 
of the interface cable 4 suspend the remote copy simi- 
larly. At any time point, changing the control bit of the 
control memory 14 by the instruction from the service 
processor panel 1 8 of the disk subsystem 3 or the disk 
subsystem 7, the remote copy can be set to suspension 
mode, or the remote copy can be restarted by releasing 
the suspension mode. 

[0049] The temporary suspension or resuming of the 
remote copy can be specified with a unit of a volume 
pair that is the minimum management unit of the mem- 
ory region inside the disk subsystem at setting of the 
remote copy. Here, the volume pair indicates a pair of a 
volume inside a disk subsystem 3 which is the source 
of copying and a corresponding volume inside the disk 
subsystem 7 which is the destination of copying. Chang- 
ing the state with the unit of a volume group is enabled 
by defining at least one volume pair as a volume group. 
Here, it is assumed that the data update inside the vol- 
ume group does not affect other volume groups. In this 
case, the control bits in the control memory 14 are ar- 
ranged for each of volume pairs or volume groups. The 
disk subsystem 3 judges whether the remote copy of the 
data is in suspended mode or not by referring to the vol- 
ume or the volume group to store the data, and to the 
corresponding control bit of the control memory 14 at 
step 36. 

[0050] Therefore, for example, in a system, wherein 
all of the data of the disk subsystem storing two or more 
business data are always duplicated, and in case there 
is a purpose on the side of remote center to use the data 
at the state of the end of the business regarding to a 
certain business, the business can be performed at the 
remote center, as the data at the end point state of the 
business can be maintained in the disk subsystem 7 of 
the remote center by setting the remote copy of the vol- 
ume pair or the volume group, wherein the business da- 
ta at the end point of the business are stored, to the tem- 
porary suspension mode. 

[0051] When the temporary suspension mode is re- 
leased, the disk subsystems 3 and 7 resume the remote 
copy and the write commands are transferred to the disk 
subsystem 7 for the data, which were written only into 
the disk subsystem 3 of the main center and the data, 
which is not completed with the justification in the disk 
subsystem 7 of the remote center during the temporary 
suspension. With this, the state, in which the regular re- 
mote copy can be executed, is restored. 
[0052] Above described preferred embodiment of the 
present invention is a execution mode with the system 
configuration, wherein the disk subsystem 3 on the side 
of the main center 9 and the disk subsystem 7 on the 



side of the remote center 10 are connected with the cor- 
respondence of one to one. However, as the other pre- 
ferred embodiment of the present invention, the system 
can be configured as two or more disk subsystems 3 are 
connected to one disk subsystem 7 and the remote copy 
is processed with the two or more disk subsystems 3, 
which are connected with this disk sub system 7. In this 
configuration, the disk subsystem 7 arranges the re- 
ceived write commands in sequence of the serial num- 
bers of the data receipt for each production unit number 
of each of the disk subsystems 3 at receiving data in 
step 41 , selects the latest time out of the time given to 
the write command of each of the disk subsystems 3, 
decides the oldest time as the candidates of the justified 
time comparing between the selected latest time and the 
time selected among the disk subsystems 3. and de- 
cides the justified time as described above. 
[0053] As the other preferred embodiment of the 
present invention, the system can be configured as any 
of the disk subsystems 7 of the remote center is set as 
a master disk subsystem (generally, a master storage 
subsystem), and each of the disk subsystems in the re- 
mote center 10 other than the master disk subsystem is 
connected with the master disk subsystem mutually. In 
this configuration, the master disk subsystem acquires 
the candidates of the justified time held by the other disk 
subsystem itself of the remote center 1 0 by inquiring it. 
Then, the master disk subsystem decides the justified 
time by deciding the oldest time as the justified time 
comparing between the acquired candidate of the justi- 
fied time and the candidate of the justified time owned 
by itself, and notifies the decided justified time to other 
disk subsystems of the remote center. Each of the disk 
subsystems 7 executes the Justification of each of the 
data. 

[0054] With this preferred embodiment of the present 
invention, executing the decision of the candidates of 
the justified time and the justification of the data accord- 
ing to the justified time by the disk subsystem 7 of the 
remote center 1 0, asynchronous remote copy is realized 
maintaining the consistency of the update sequence 
with the function of the disk subsystem, without the in- 
tervention of the host unit, without the processing peri- 
odically interrupting the remote copy, without arranging 
gateway subsystems to each of the main center 9 and 
the remote center 10 for maintaining the update se- 
quence, and without the deterioration of the processing 
performance of the disk subsystem 3 of the man center 
9. 

[0055] According to the preferred embodiment of the 
present invention, the consistency of the update data is 
assured utilizing the time acquired from the clock held 
by the disk subsystem 3; therefore, asynchronous type 
remote copy can be realized without the intervention of 
the host unit, even with the system in which the data 
transferred from the host unit 1 to the disk subsystem 3 
are not given with the time information. 
[0056] The data are recorded in the disk subsystem 
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7 of the remote center 10 with the state maintaining the 
consistency of the data, if the function of the main center 
9 is interrupted by the disasters of the main center or by 
the failures of devises. At) of these are realized by the 
functions of the disk subsystems 3 and 7, and will not 
be a load to the processing capacity of the host unit 1. 
When the main center 9 suffers from disasters, the busi- 
ness can be restarted by the recovery worl< of such as 
the reexecution of jobs utilizing the data of the disk sub- 
systems 7. 

[0057] As above described, the present invention 
does not require the introduction of new software to the 
host unit, can assure the consistency of the update data 
in the range users expect with the functions of storage 
subsystems, and can realize the asynchronous type re- 
mote copy, which are easy to introduce and without the 
deterioration of the processing performance of the host 
computer. 



Claims 

1 . The method of duplicating data in a system provid- 
ed with a first storage subsystem group comprising 
two or more storage subsystems and a second stor- 
age subsystem group comprising two or more stor- 
age subsystems storing the copies of the data of 
the first storage subsystem group, wherein each of 
the storage subsystems which belongs to the first 
storage subsystem group writes the data into a stor- 
age device of the storage subsystem giving a serial 
number and a time, transfers the data through a 
transfer line to the storage subsystem which be- 
longs to the second storage subsystem group, two 
or more data received by each of the storage sub- 
systems which belongs to the second storage sub- 
system group are arranged in sequence of the serial 
numbers, the oldest time is decided among the 
kinds of latest time given to each of the storage sub- 
systems with the communications among the stor- 
age subsystems which belong to the second stor- 
age subsystem group, and the data given with the 
time not later than the decided oldest time are de- 
cided as the object of data writing to the storage de- 
vice of each of the storage subsystems. 

2. The method of claim 1 , wherein when a storage de- 
vice of said storage subsystem is comprised of two 
or more volumes and a volume pair is comprised of 
a volume which belongs to the first storage subsys- 
tem group which is the source of copying and a vol- 
ume which belongs to the second storage subsys- 
tem group which is the destination of copying, the 
storage subsystem which belongs to the first stor- 
age subsystem group controls the start and the stop 
of the data transmission to the second storage sub- 
system group with the unit of each of the volume 
groups comprised of two or more volume pairs. 



3. A data duplicating system, wherein the data dupli- 
cating system is provided with a first storage sub- 
system group comprising two or more storage sub- 
systems and a second storage subsystem group 

5 comprising two or more storage subsystems which 

memorise the copies of the data of the first storage 
subsystem group, wherein each of the storage sub- 
systems which belongs to the first storage subsys- 
tem group is provided with the means of writing data 

10 to a storage device of the storage subsystem, the 
means of giving a serial number and a time to said 
data, the means of transmitting said data given with 
the serial number and the time through a transmis- 
sion line to the storage subsystem which belongs 

15 to the second storage subsystem group, the means 
with which each of the storage subsystems which 
belongs to the second storage subsystem group ar- 
ranges the received two or more data in sequence 
of the serial numbers, the means for deciding the 

20 oldest time among the kinds of latest time given to 
each of the storage subsystems by the communi- 
cation among the storage subsystems which belong 
to the second storage subsystem group, and the 
means of writing the data given with the time not 

25 later than the decided oldest time to the storage de- 
vice of each of the storage subsystems. 

4. The system of claim 3, wherein the storage devices 
of said storage subsystem comprises two or more 

30 volumes and when a volume which belongs to the 
first storage subsystem group which is the source 
of copying and a volume which belongs to the sec- 
ond storage subsystem group which Is the destina- 
tion of copying form a volume pair, the storage sub- 

35 system which belongs to the first storage subsys- 
tem group is provided with the means for controlling 
the start and stop of the data transmission to the 
second storage subsystem group with the unit of a 
volume group comprising two or more volume pairs. 

40 

5. The invention of any preceding claim, wherein said 
transmission line connecting between a storage 
subsystem which belongs to said first storage sub- 
system group and a storage subsystem which be- 

45 longs to said second storage subsystem group is 
comprised of Storage Area Network (SAN). 

6. The invention of any preceding claim, wherein a 
clock for referring to said time by each of the storage 

50 subsystems which belong to said first storage sub- 
system group is corrected by time information from 

outside. 

7. The invention of any preceding claim, wherein con- 
55 nections among the storage subsystems which be- 
long to said second storage subsystem group are 
made by loop transmission lines, and each of the 
storage subsystems transfers the older time out of 
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said own latest time and said received latest time 
to adjacent storage subsystem, and decides the 
time transferred by the own storage subsystem and 
returned to the own storage subsystem as said old- 
est time. 

8. The invention of any preceding claim, wherein one 
of the two or more storage subsystems which be- 
long to said second storage subsystem group is set 
as a master storage subsystem, each of storage 
subsystems other than the master storage subsys- 
tem notifies said latest time to the master storage 
subsystem, and the master storage subsystem de- 
cides said oldest time out of own latest time and re- 
ceived latest time. 

9. The invention of any preceding claim, wherein two 
or more storage subsystems which belong to said 
first storage subsystem group transfer each of writ- 
ing data to one of the storage subsystems which 
belong to said second storage subsystem group, 
the one of the storage subsystems which belong to 
the second storage subsystem group selects the 
latest time that is given to each of said storage sub- 
systems which belongs to the first storage subsys- 
tem group and decides the oldest time out of the 
selected latest time as the candidates of said oldest 
time. 

1 0. A storage subsystem, wherein a storage subsystem 
belongs to a storage subsystem group comprising 
two or more storage subsystems and said storage 
subsystem being provided with a first means of writ- 
ing data received from outside to the storage device 
of the storage subsystem, a second means of trans- 
mitting said data being given a serial number and a 
time to other storage subsystems, a third means of 
arranging two or more data received from other 
storage subsystems in sequence of said serial num- 
bers, and a fourth means of deciding the oldest time 
among the kinds of latest time given to each of the 
storage subsystems by the communication among 
the other storage subsystems, operates the first 
means and the second means when said storage 
subsystem is in local mode, and decides the data 
given with the time not later than the decided oldest 
time as the object of writing data by the first means 
operating the third means, the fourth means, and 
the first means at the remote mode. 

11. The storage subsystem of claim 10, wherein said 
storage subsystem operating in said local mode is 
connected with said storage subsystem operating 
in said remote mode through Storage Area Network 
(SAN). 

12. The storage subsystem of claim 10, wherein said 
storage subsystem is provided with the means for 



correcting a clock which is for referring to said time 
by time information from outside. 

13. The storage subsystem of claim 10, wherein a loop 
5 transmission line connects among said storage 

subsystems operation in said remote mode and 
each of the storage subsystems in said remote 
mode is provided with the means of transmitting the 
older time among said own latest time and received 
10 said latest time to an adjacent storage subsystem 
and the means of deciding the time transferred by 
the own storage subsystem and returned to the own 
storage subsystem as said oldest time. 

15 1 4. The storage subsystem of claim 1 0, wherein one of 
said storage subsystems which operates in said re- 
mote mode is arranged as a master storage sub- 
system, each of the storage subsystems other than 
said master storage subsystem is provided with the 
20 means for notifying said latest time to said master 
storage subsystem and said master storage sub- 
system is provided with the means of deciding said 
oldest time out of own latest time and acquired lat- 
est time. 

25 

1 5. The storage subsystem of claim 1 0, wherein two or 
more storage subsystems which operate in said lo- 
cal mode are structured to transfer the data to one 
of the storage subsystems belonging to the storage 
30 subsystem group which operates in said remote 
mode, and said one of the storage subsystems be- 
longing to the storage subsystem group which op- 
erates in said remote mode is provided with the 
means of selecting the latest time given to each of 
35 the storage subsystems which operates in said lo- 
cal mode and the means of deciding the oldest time 
among the selected latest times as the candidates 
of said oldest time. 

40 16. The storage subsystem of claim 10. wherein the 
storage device of said storage subsystems com- 
prises two or more volumes and when a volume be- 
longing to the storage subsystem which is the 
source of copying operating in said local mode and 

45 a volume belonging to the storage subsystem which 
is the destination of copying operating in said re- 
mote mode form a volume pair, the storage subsys- 
tem operating in said local mode is provided with 
the means of controlling the start and stop of the 

50 data transmission to the storage subsystem oper- 
ating in said remote mode for each of volume 
groups comprising one or more volume pairs. 
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